
' ' . Errors Corrected by the STIC sjAsms Branch 
Seriaf Number: 

|~| / Changed a file from non-ASCII to ASCII ^ ^ C (J 

[3f Changed the margins in cases where the sequence text was Vrapped" dov^ to the next line 
I I Edited a format error in the Current Application Data section, specifically; 

I I Edited the Cun-ent Application Data section with the actual current number. The number inputted by the 
applicant was □ the prior application data; or □ other . 

f I Added the mandatory heading and subheadings for "Current Application Data". 

Edited the "Number of Sequences" field. The applicant spelled out a number instead of using an integer. 

I — I Changed the spelling of a mandatory field (the headings or subheadings), specifically: 

I I Con-ected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 

Q Inserted or con-ected a nudeic number at the end of a nucleic line. SEQ ID NO's edited: 

I — I Con-ected subheading placement. All responses must be on the same line as each subheading. If the 

— applicant placed a response below the subheading, this was moved to its appropriate place. 

I — [ Inserted colons after headings/subheadings. Headings edited included: 

[ — I Deleted extra, invalid, headings used by an applicant, specifically: 

I — I Deleted: □ non-ASCII "garbage" at the beginning/end of files; □ secretary initials/filename at end of file; 

— □ page numbers throughout text; □ other invalid text, such as \ . 

I I Inserted mandatory headings, specifically: 

I I Corrected an obvious error in the response, spedfically: cp "-rx 

. . . . ^ . -iD. 

I I Edited identifiers where upper case is used but lower case is required, or vice versa. :p ^ \^ 

I I Con-ected an error in the Number of Sequences field, specifically: 

I I A "Hard Page Break" code was inserted by the applicant. All occun-ences had to be deleted. 

I I Deleted e/K//ng stop codon in amino add sequences and adjusted the "(A)Length:" field accordingly (en-or 
due to a Patentin bug). Sequences con-ected: 

I I Other: 



^Examiner: -Uie^ove corrections must be communicated to the applicant in the first Office 
ActloS DO NOT send a copy of this form, ^^^^^ 
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4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: Adminis . of Tulane Educational, Fund 

7 Philipp, Mario T. 

9 (ii) TITLE OF INVENTION: Surface Antigens and Proteins Useful in 

10 Compositions for the Diagnosis and Prevention of Lyme 

11 Disease 
13 (iii) NUMBER OF SEQUENCES: 14 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: Howson and Howson 

17 (B) STREET: Spring House Corporate Cntr., P.O. Box 457 

18 (C) CITY: Spring House 

19 (D) STATE: Pennsylvania 

20 (E) COUNTRY: USA 

21 (F) ZIP: 19477 

2 3 (V) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Floppy disk 

25 (B) COMPUTER: IBM PC compatible 

2 6 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

27 (D) SOFTWARE: PatentIn Release #1.0, Version #1.30 

C--> 29 (vi) CURRENT APPLICATION DATA: 

C--> 30 (A) APPLICATION NUMBER: US/09/4 4 5,8 03 

C--> 31 (B) FILING DATE: 13-Dec-1999 

32 (C) CLASSIFICATION: 

34 (vii) PRIOR APPLICATION DATA: 

35 (A) APPLICATION NUMBER: US 60/051,271 

36 (B) FILING DATE: 30-JUN-1997 

38 (viii) ATTORNEY/AGENT INFORMATION: 

39 (A) NAME: Bak, Mary E. 

40 (B) REGISTRATION NUMBER: 31,215 

41 (C) REFERENCE/DOCKET NUMBER: TUL2APCT 
4 3 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: 215-540-9200 

45 (B) TELEFAX: 215-540-5818 
4 8 (2) INFORMATION FOR SEQ ID NO: 1: 

50 (i) SEQUENCE CHARACTERISTICS: 

51 (A) LENGTH: 1047 base pairs 

52 (B) TYPE: nucleic acid 

53 (C) STRANDEDNESS : double 

54 (D) TOPOLOGY: unknown 
56 (ii) MOLECULE TYPE: cDNA 

59 (ix) FEATURE: 

60 (A) NAME/KEY: CDS 

61 (B) LOCATION: 1..1047 

64 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

66 AAG AAT AAT GAT CAT GAT AAT CAT AAG GGG ACT GTT AAG AAT GCT GTT 4 8 

67 Lys Asn Asn Asp His Asp Asn His Lys Gly Thr Val Lys Asn Ala Val 

68 1 5 10 15 
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70 GAT 


ATG 


GCA 


AAG 


GCC 


GCT 


GAG 


GAA 


GCT 


GCA 


AGT 1 


GCT 


GCA 


AGT 


GCT 


GCT 


96 


71 Asp 


Met 


Ala 


Lys 


Ala 


Ala 


Glu 


Glu 


Ala 


Ala 


Ser , 


Ala 


Ala 


Ser 


Ala 


Ala 




72 






20 










25 










30 








74 ACT 


GGT 


AAT 


GCA 


GCG 


ATT 


GGG 


GAT 


GTT 


GTT 


AAG . 


AAT 


AGT 


GGG 


GCA 


GCA 


144 


7 5 Thr 


Gly 


Asn 


Ala 


Ala 


He 


Gly 


Asp 


Val 


Val 


Lys 


Asn 


Ser 


Gly 


Ala 


Ala 




76 




35 










40 










45 










78 GCA 


AAA 


GGT 


GGT 


GAG 


GCG 


GCG 


AGT 


GTT 


AAT 


GGG , 


ATT 


GCT 


AAG 


GGG 


ATA 


192 


79 Ala 


Lys 


Gly 


Gly 


Glu 


Ala 


Ala 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 


He 




80 


50 










55 










60 












82 AAG 


GGG 


ATT 


GTT 


GAT 


GCT 


GCT 


GGA 


AAG 


GCT 


GAT ' 


GCG 


AAG 


GAA 


GGG 


AAG 


240 


8 3 Lys 


Gly 


He 


Val 


Asp 


Ala 


Ala 


Gly 


Lys 


Ala 


Asp 


Ala 


Lys 


Glu 


Gly 


Lys 




84 65 










70 










75 










80 




86 TTG 


GAT 


GCT 


ACT 


GGT 


GCT 


GAG 


GGT 


ACG 


ACT 


AAC ' 


GTG 


AAT 


GCT 


GGG 


AAG 


288 


87 Leu 


Asp 


Ala 


Thr 


Gly 


Ala 


Glu 


Gly 


Thr 


Thr 


Asn 


val 


Asn 


Ala 


Gly 


Lys 




88 








85 










90 










95 






90 TTG 


TTT 


GTG 


AAG 


AGG 


GCG 


GCT 


GAT 


GAT 


GGT 


GGT 1 


GAT 


GCA 


GAT 


GAT 


GCT 


336 


91 Leu 


Phe 


Val 


Lys 


Arg 


Ala 


Ala 


Asp 


Asp 


Gly 


Gly , 


Asp 


Ala 


Asp 


Asp 


Ala 




92 






100 










105 










110 








94 GGG 


AAG 


GCT 


GCT 


GCT 


GCG 


GTT 


GCT 


GCA 


AGT 


GCT 1 


GCT 


ACT 


GGT 


AAT 


GCA 


384 


95 Gly 


Lys 


Ala 


Ala 


Ala 


Ala 


Val 


Ala 


Ala 


Ser 


Ala , 


Ala 


Thr 


Gly 


Asn 


Ala 




96 




115 










120 










125 










98 GCG 


ATT 


GGA 


GAT 


GTT 


GTT 


AAT 


GGT 


GAT 


GTG 


GCA , 


AAA 


GCA 


AAA 


GGT 


GGT 


432 


99 Ala 


He 


Gly 


Asp 


Val 


Val 


Asn 


Gly 


Asp 


Val 


Ala 


Lys 


Ala 


Lys 


Gly 


Gly 




100 


130 










135 










140 












102 GAT 


GCG 


GCG 


AGT 


GTT 


AAT 


GGG 


ATT 


GCT 


AAG 


GGT 


ATA 


AAG 


GGG 


ATT 


GTT 


480 


103 Asp 


Ala 


Ala 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 


He 


Lys 


Gly 


He 


Val 




104 145 










150 










155 










160 




106 GAT 


GCT 


GCT 


GAG 


AAG 


GCT 


GAT 


GCG 


AAG 


GAA 


GGG 


AAG 


TTG 


AAT 


GCT 


GCT 


528 


107 Asp 


Ala 


Ala 


Glu 


Lys 


Ala 


Asp 


Ala 


Lys 


Glu 


Gly 


Lys 


Leu 


Asn 


Ala 


Ala 




108 








165 










170 










175 






110 GGT 


GCT 


GAG 


GGT 


ACG 


ACT 


AAC 


GCG 


GAT 


GCT 


GGG 


AAG 


TTG 


TTT 


GTG 


AAG 


576 


111 Gly 


Ala 


Glu 


Gly 


Thr 


Thr 


Asn 


Ala 


Asp 


Ala 


Gly 


Lys 


Leu 


Phe 


Val 


Lys 




112 






180 










185 










190 








114 AAT 


GCT 


GGT 


AAT 


GTG 


GGT 


GGT 


GAA 


GCA 


GGT 


GAT 


GCT 


GGG 


AAG 


GCT 


GCT 


624 


115 Asn 


Ala 


Gly 


Asn 


Val 


Gly 


Gly 


Glu 


Ala 


Gly 


Asp 


Ala 


Gly 


Lys 


Ala 


Ala 




116 




195 










200 










205 










118 GCT 


GCG 


GTT 


GCT 


GCT 


GTT 


AGT 


GGG 


GAG 


CAG 


ATA 


TTA 


AAA 


GCG 


ATT 


GTT 


672 


119 Ala 


Ala 


Val 


Ala 


Ala 


Val 


Ser 


Gly 


Glu 


Gin 


He 


Leu 


Lys 


Ala 


He 


Val 




120 


210 










215 










220 












122 CAT 


GCT 


GCT 


AAG 


GAT 


GGT 


GGT 


GAG 


AAG 


CAG 


GGT 


AAG 


AAG 


GCT 


GCG 


GAT 


720 


123 His 


Ala 


Ala 


Lys 


Asp 


Gly 


Gly 


Glu 


Lys 


Gin 


Gly 


Lys 


Lys 


Ala 


Ala 


Asp 




124 225 










230 










235 










240 




126 CGT 


AC A 


AAT 


CCC 


ATT 


GAC 


GCG 


GCT 


ATT 


GGG 


GGT 


GCG 


GGT 


GAT 


AAT 


GAT 


768 


127 Arg 


Thr 


Asn 


Pro 


He 


Asp 


Ala 


Ala 


He 


Gly 


Gly 


Ala 


Gly 


Asp 


Asn 


Asp 




128 








245 










250 










255 






130 GCT 


GCT 


GCG 


GCG 


TTT 


GCT 


ACT 


ATG 


AAG 


AAG 


GAT 


GAT 


CAG 


ATT 


GCT 


GCT 


816 


131 Ala 


Ala 


Ala 


Ala 


Phe 


Ala 


Thr 


Met 


Lys 


Lys 


Asp 


Asp 


Gin 


He 


Ala 


Ala 




132 






260 










265 










270 








134 GCT 


ATG 


GTT 


CTG 


AGG 


GGA 


ATG 


GCT 


AAG 


GAT 


GGG 


CAA 


TTT 


GCT 


TTG 


AAG 


864 
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135 


Ala 


Met 


Val 


Leu 


Arg 


Gly 


Met 


Ala 


Lys 


Asp 


Gly 


Gin 


Phe 


Ala 


Leu 


Lys 




136 






275 










280 










285 










138 


GAT 


GCT 


GCT 


GCT 


GCT 


CAT 


GAA 


GGG 


ACT 


GTT 


AAG 


AAT 


GCT 


GTT 


GAT 


ATA 


912 


139 


Asp 


Ala 


Ala 


Ala 


Ala 


His 


Glu 


Gly 


Thr 


Val 


Lys 


Asn 


Ala 


Val 


Asp 


He 




140 




290 










295 










300 












142 


ATA 


AAG 


GCT 


GCT 


GCG 


GAA 


GCT 


GCA 


AGT 


GCT 


GCA 


AGT 


GCT 


GCT 


ACT 


GGT 


960 


143 


He 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Gly 




144 


305 










310 










315 










320 




146 


AGT 


GCA 


GCA 


ATT 


GGG 


GAT 


GTT 


GTT 


AAT 


GGT 


AAT 


GGA 


GCA 


ACA 


GCA 


AAA 


1001 


147 


Ser 


Ala 


Ala 


He 


Gly 


Asp 


Val 


Val 


Asn 


Gly 


Asn 


Gly 


Ala 


Thr 


Ala 


Lys 




148 










325 










330 










335 






150 


GGT 


GGT 


GAT 


GCG 


AAG 


AGT 


GTT 


AAT 


GGC 


ATT 


GCT 


AAG 


GGA 








104' 


151 


Gly 


Gly 


Asp 


Ala 


Lys 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 










152 








340 










345 


















155 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 2: 


















157 




(i) SEQUENCE CHARACTERISTICS: 


















158 






(A) LENGTH: 349 amino 


acids 
















159 






(B) TYPE: 


amino acid 




















160 






(D) TOPOLOGY: 


linear 




















162 




(ii) MOLECULE TYPE: 


protein 




















164 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2; 














166 


Lys 


Asn 


Asn 


Asp 


His 


Asp 


Asn 


His 


Lys 


Gly 


Thr 


Val 


Lys 


Asn 


Ala 


Val 




167 


1 








5 










10 










15 






169 


Asp 


Met 


Ala 


Lys 


Ala 


Ala 


Glu 


Glu 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Ala 


Ala 




170 








20 










25 










30 








172 


Thr Gly 


Asn 


Ala 


Ala 


He 


Gly Asp 


Val 


Val 


Lys 


Asn 


Ser 


Gly 


Ala 


Ala 




173 






35 










40 










45 










175 


Ala 


Lys 


Gly 


Gly 


Glu 


Ala 


Ala 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 


He 




176 




50 










55 










60 












178 


Lys 


Gly 


He 


Val 


Asp 


Ala 


Ala Gly 


Lys 


Ala 


Asp 


Ala 


Lys 


Glu 


Gly 


Lys 




179 


65 










70 










75 










80 




181 


Leu 


Asp 


Ala 


Thr 


Gly 


Ala 


Glu 


Gly 


Thr 


Thr 


Asn 


Val 


Asn 


Ala 


Gly 


Lys 




182 










85 










90 










95 






184 


Leu 


Phe 


Val 


Lys 


Arg 


Ala 


Ala 


Asp 


Asp 


Gly 


Gly 


Asp 


Ala 


Asp 


Asp 


Ala 




185 








100 










105 










110 








187 


Gly 


Lys 


Ala 


Ala 


Ala 


Ala 


Val 


Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Gly 


Asn 


Ala 




188 






115 










120 










125 










190 


Ala 


He 


Gly 


Asp 


val 


Val 


Asn 


Gly 


Asp 


Val 


Ala 


Lys 


Ala 


Lys 


Gly 


Gly 




191 




130 










135 










140 












193 


Asp 


Ala 


Ala 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 


He 


Lys 


Gly 


He 


Val 




194 


145 










150 










155 










160 




196 


Asp 


Ala 


Ala 


Glu 


Lys 


Ala 


Asp 


Ala 


Lys 


Glu 


Gly 


Lys 


Leu 


Asn 


Ala 


Ala 




197 










165 










170 










175 






199 


Gly 


Ala 


Glu 


Gly 


Thr 


Thr 


Asn 


Ala 


Asp 


Ala 


Gly 


Lys 


Leu 


Phe 


Val 


Lys 




200 








180 










185 










190 








202 


Asn 


Ala 


Gly 


Asn 


Val 


Gly 


Gly 


Glu 


Ala 


Gly 


Asp 


Ala 


Gly 


Lys 


Ala 


Ala 




203 






195 










200 










205 










205 


Ala 


Ala 


Val 


Ala 


Ala 


Val 


Ser 


Gly 


Glu 


Gin 


He 


Leu 


Lys 


Ala 


He 


Val 




206 




210 










215 










220 













file://C:\CRF3\OutholcfvVsrI445803.htm 



5/22/00 



Page 4 of 7 



RAW SEQUENCE LISTING DATE: 05/22/2000 

PATENT APPLICATION: US/09/44 5,803 TIME: 18:53:32 



Input Set : A:\Pto.ainc 

Output Set: N:\CRF3\05222000\I445803.raw 



208 


His 


Ala 


Ala 


Lys 


Asp 


Gly 


Gly 


Glu 


Lys 


Gin 


Gly 


Lys 


Lys 


Ala 


Ala 


Asp 


209 


225 










230 










235 










240 


211 


Arg 


Thr 


Asn 


Pro 


He 


Asp 


Ala 


Ala 


He 


Gly 


Gly 


Ala 


Gly 


Asp 


Asn 


Asp 


212 










245 










250 










255 




214 


Ala 


Ala 


Ala 


Ala 


Phe 


Ala 


Thr 


Met 


Lys 


Lys 


Asp 


Asp 


Gin 


He 


Ala 


Ala 


215 








260 










265 










270 






217 


Ala 


Met 


Vai 


Leu 


Arg 


Gly 


Met 


Ala 


Lys 


Asp 


Gly Gin 


Phe 


Ala 


Leu 


Lys 


218 






275 










280 










285 








220 


Asp 


Ala 


Ala 


Ala 


Ala 


His 


Glu 


Gly 


Thr 


Val 


Lys 


Asn 


Ala 


val 


Asp 


He 


221 




290 










295 










300 










223 


lie 


Lys 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Gly 


224 


305 










310 










315 










320 


226 


Ser 


Ala 


Ala 


lie 


Gly 


Asp 


Val 


Val 


Asn 


Gly 


Asn 


Gly 


Ala 


Thr 


Ala 


Lys 


227 










325 










330 










335 




229 


Gly 


Gly 


Asp 


Ala 


Lys 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 









230 340 345 

2 32 (2) INFORMATION FOR SEQ ID NO: 3: 

234 (i) SEQUENCE CHARACTERISTICS: 

235 (A) LENGTH: 283 base pairs 

236 (B) TYPE: nucleic acid 

2 37 (C) STRANDEDNESS : double 

2 38 (D) TOPOLOGY: unknown 

24 0 (ii) MOLECULE TYPE: DNA (genomic) 

245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

24 7 GCCGCTGGAT GGTGGTGAGA AGCAGGGTAA GAAGGCTGCG GATCGTACAA ATCCCATTGA 60 

249 CGCGGCTATT GGGGGTGCGG GTGATAATGA TGCTGCTGCG GCGTTTGCTA CTATGAAGAA 120 

2 51 GGATGATCAG ATTGCTGCTG CTATGGTTCT GAGGGGAATG GCTAAGGATG GGCAATTTGC 180 

2 53 TTTGAAGGAT GCTGCTGCTG CTCATGAAGG GACTGTTAAG AATGCTGTTG ATATAATAAA 24 0 

2 55 GGCTGCTGCG GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGT 283 

2 57 (2) INFORMATION FOR SEQ ID NO: 4: 



2 59 (i) SEQUENCE CHARACTERISTICS: 

260 (A) LENGTH: 233 base pairs 

261 (B) TYPE: nucleic acid 

262 (C) STRANDEDNESS: double 

263 (D) TOPOLOGY: unknown 

265 (ii) MOLECULE TYPE: DNA (genomic) 

270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



272 TTTATTATAT CAACAGATTC TTAACAGTCC CTTCATGAGC AGCAGCAGCA TCCTTCAAAG 60 

274 CAAATTGCCC ATCCTTAGCC ATTCCCCTCA GAACCATAGC AGCAGCAATC TGATCATCCT 120 

276 TCTTCATAGT AGCAAACGCC GCAGCAGCAT CATTATCACC CGCACCCCCA ATAGCCGCGT 180 

278 CAATCGGATT TGTACGATCC GCAGCCTTCT TACCCTGCTT CTCACCACCA TCC 233 

280 (2) INFORMATION FOR SEQ ID NO: 5: 



282 (i) SEQUENCE CHARACTERISTICS: 

283 (A) LENGTH: 194 base pairs 

284 (B) TYPE: nucleic acid 

285 (C) STRANDEDNESS: double 

286 (D) TOPOLOGY: unknown 

288 (ii) MOLECULE TYPE: DNA (genomic) 

293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



file://C:\CRF3\Outhold\VsrI445803.htm 



5/22/00 



Page 5 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/44 5,803 



DATE: 05/22/2000 
TIME: 18:53:32 



Input Set : A:\Pto.amc 

Output Set: N:\CRF3\05222000\I445803.raw 



295 CCGTGCAAGC TGGGTTGAAG AAGGTTGGGG ATGTTGTTAA GAATAGTGAG GCAAAAGATG 

297 GTGATGCGGC GAGTGTTAAT GGGATTGCTA AGGGGATAAA GGGGATTGTT GATGCTGCTG 

299 AGAAGGCTGA TGCGAAGGAA GGGAAGTTGG TATGTGGCTG GTGCTGCTGG TGAAACTAAC 

301 AAGGAAGCGG CCGC 

303 (2) INFORMATION FOR SEQ ID NO : 6: 

305 (i) SEQUENCE CHARACTERISTICS: 

306 (A) LENGTH: 369 base pairs 

307 (B) TYPE: nucleic acid 

308 (C) STRANDEDNESS : double 

309 (D) TOPOLOGY: unknown 

311 (ii) MOLECULE TYPE: DNA (genomic) 

316 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

318 GCGGCCGCTT GAGGAAGCTG CAAGTGCTGC AAGTGCTGCT ACTGGTAATG CAGCGATTGG 

320 GGATGTTGTT AAGAATAGTG GGGCAGCAGC AAAAGGTGGT GAGGCGGCGA GTGTTAATGG 

3 22 GATTGCTAAG GGGATAAAGG GGATTGTTGA TGCTGCTGGA AAGGCTGATG CGAAGGAAGG 

3 24 GAAGTTGGAT GCTACTGGTG CTGAGGGTAC GACTAACGTG AATGCTGGGA AGTTGTTTGT 

326 GAAGAGGGCG GCTGATGATG GTGGTGATGC AGATGATGCT GGGAAGGCTG CTGCTGCGGT 

328 TGCTGCAAGT GCTGCTACTG GTAATGCAGC GATTGGAGAT GTTGTTAATG GTGATGTGGC 

3 30 AAAACAAAA 

332 (2) INFORMATION FOR SEQ ID NO: 7: 
3 34 (i) SEQUENCE CHARACTERISTICS: 

335 (A) LENGTH: 142 base pairs 

336 (B) TYPE: nucleic acid 

337 (C) STRANDEDNESS: double 
3 38 (D) TOPOLOGY: unknown 

340 (ii) MOLECULE TYPE: DNA (genomic) 

345 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

3 47 AAGGATGGTG ATGATAAGCA GGGTAAGAAG GCTGAGGATG CTACAAATCC GATTGACGCG 

349 GCTATTGGGG GTGCAGGTGC GGGTGCTAAT GCTGCTGCGG CGTTTAATAA TATGAAGAAG 

3 51 GATGATCAGA TTGAGCGGCC GC 

3 53 (2) INFORMATION FOR SEQ ID NO : 8: 

3 55 (i) SEQUENCE CHARACTERISTICS: 

356 (A) LENGTH: 210 base pairs 

357 (B) TYPE: nucleic acid 

358 (C) STRANDEDNESS: double 
3 59 (D) TOPOLOGY: unknown 

3 61 (ii) MOLECULE TYPE: DNA (genomic) 

366 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

3 68 GGTGAAACTA ACAAGGATGC TGGGAAGTTG TTTGTGAAGA AGAATGGTGA TGATGGTGGT 

370 GATGCAGGTG ATGCTGGGAA GGCTGCTGCT GCGGTTGCTG CTGTTAGTGG GGAGCAGATA 

372 TTAAAAGCGA TTGTTGATGC TGCTAAAGAT GGTGATAAGA CGGGGGTTAC TGATGTAAAG 

374 GATGCTACAA ATCCGATTGA CGCGGCTATT 

376 (2) INFORMATION FOR SEQ ID NO : 9: 

378 (i) SEQUENCE CHARACTERISTICS: 

379 (A) LENGTH: 236 base pairs 

380 (B) TYPE: nucleic acid 

381 (C) STRANDEDNESS: double 

382 (D) TOPOLOGY: unknown 

384 (ii) MOLECULE TYPE: DNA (genomic) 
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180 
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60 
120 
180 
210 
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1711 







7 






9 






10 






11 






13 






15 






16 






17 






18 






19 






20 






21 






23 






24 






25 






26 






27 


c- 


-> 


29 


c- 


-> 


30 


c- 


-> 


31 






32 






34 






35 






36 






38 






39 






40 






41 






43 






44 






45 



RAW SEQUENCE LISTING DATE: 05/12/2000 

PATENT APPLICATION : US/09/445,803 TIME: 11:04:23 

Input Set : A:\sequence.lst.txt 

Output Set: N:\CKF3\05122000\I445803.raw 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Admin is . of Tulane Educational, Fund 
Philipp, Mario T. 
(ii) TITLE OF INVENTION: Surface Antigens and Proteins Useful in 

Compositions for the Diagnosis and Prevention of Lyme 
Disease 

(iii) NUMBER OF SEQUENCES: 14 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr. , P.O. Box 457 r^/\A» Kl/^* ^/**^wkiw 

(C) CITY: Spring House UOOS NOT UOmpiy 

(D) STATE: Pennsylvania CorrBCtecl DiskettG Needed 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPQTER READABLE FORM : ' 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUflBER: US/09/445,803 

(B) PILING DATE: 13-Dec-1999 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/051,271 

(B) FILING DATE: 30-JUN-1997 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Bak, Mary E. 

(B) REGISTRATION NUMBER: 31,215 

(C) REFERENCE/DOCKET NUMBER: TUL2APCT 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-540-9200 

(B) TELEFAX: 215-540-5818 



ERRORED SEQUENCES 

48 (2) INFORliATION FOR SEQ ID NO: 1: 



50 (i) SEQUENCE CHARACTERISTICS: 

51 {A) LENGTH: 1047 base pairs 

52 (B) TYPE: nucleic acid 

5 3 (C) STRANDEDNESS : double 

54 (D) TOPOLOGY: unknown 

56 (ii) MOLECULE TYPE: cDNA 

59 (ix) FEATURE: 

60 (A) NAME/KEY: CDS 

61 (B) LOCATION: 1. .1047 
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RAW SEQUENCE LISTING DATE: 05/12/2000 

PATENT APPLICATION: US/09/44 5,803 TIME: 11:04:23 

Input Set : A:\sequence.lst.txt. 
Output Set: N:\CRF3\05122000\I445803.raw 

64 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
W--> 66 AAG AAT AAT GAT CAT GAT AAT CAT AAG GGG ACT GTT AAG AAT GCT 
W--> 67 48 . • 

68 Lys Asn Asn Asp His Asp Asn His Lys Gly Thr Val Lys Asn Ala 
W--> 69 1 5 10 15 
W--> 71 GAT ATG GCA AAG GCC GCT GAG GAA GCT GCA AGT GCT GCA ACT GCT 
W--> 72 96*""* 

73 Asp Met Ala Lys Ala Ala Glu Glu Ala Ala Ser Ala Ala Ser Ala 
W--> 74 20 25 30 

W--> 76 ACT GGT AAT GCA GCG ATT GGG GAT GTT GTT AAG AAT AGT GGG GCA 
W--> 77 144 

78 Thr Gly Asn Ala Ala lie Gly Asp Val Val Lys Asn Ser Gly Ala 
W--> 79 35 40 45 

W--> 81 GCA AAA GGT GGT GAG GCG GCG AGT GTT AAT GGG ATT GCT AAG GGG 
W--> 82 192 

83 Ala Lys Gly Gly Glu Ala Ala Ser Val Asn Gly He Ala Lys Gly 
W--> 84 50 55 60 

W--> 86 AAG GGG ATT GTT GAT GCT GCT GGA AAG GCT GAT GCG AAG GRA GGG 
W--> 87 240 

88 Lys Gly He Val Asp Ala Ala Gly Lys Ala Asp Ala Lys Glu Gly 
W--> 89 65 70 75 

W--> 91 TTG GAT GCT ACT GGT GCT GAG GGT ACG ACT AAC GTG AAT GCT GGG 
W--> 92 288 

93 Leu Asp Ala Thr Gly Ala Glu Gly Thr Thr Asn Val Asn Ala Gly 
W--> 94 85 90 95 

W--> 96 TTG TTT GTG AAG AGG GCG GCT GAT GAT GGT GGT GAT GCA GAT GAT 
W--> 97 336 

98 Leu Phe Val Lys Arg Ala Ala Asp Asp Gly Gly Asp Ala Asp Asp 
W--> 99 100 105 110 



w- 


-> 


101 


GGG 


AAG 


GCT 


GCT 


GCT 


GCG 


GTT 


GCT 


GCA 


AGT 


GCT 


GCT 


ACT 


GGT 


AAT 


GCA 


w- 


-> 


102 


384 




































103 


Gly 


Lys 


Ala 


Ala 


Ala 


Ala 


Val 


Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Gly 


Asn 


Ala 


w- 


-> 


104 






115 










120 










125 








w- 


-> 


106 


GCG 


ATT 


GGA 


GAT 


GTT 


GTT 


AAT 


GGT 


GAT 


GTG 


GCA 


AAA 


GCA 


AAA 


GGT 


GGT 


w- 


-> 


107 


432 




































108 


Ala 


He 


Gly 


Asp 


Val 


Val 


Asn 


Gly 


Asp 


Val 


Ala 


Lys 


Ala 


Lys 


Gly 


Gly 


w- 


-> 


109 




130 










135 










140 










w- 


-> 


111 


GAT 


GCG 


GCG 


AGT 


GTT 


AAT 


GGG 


ATT 


GCT 


AAG 


GGT 


ATA 


AAG 


GGG 


ATT 


GTT 


w- 


-> 


112 


480 




































113 


Asp 


Ala 


Ala 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 


He 


Lys 


Gly 


He 


Val 


w- 


-> 


114 


145 










150 










155 










160 


w- 


-> 


116 


GAT 


GCT 


GCT 


GAG 


AAG 


GCT 


GAT 


GCG 


AAG 


GAA 


GGG 


AAG 


TTG 


AAT 


GCT 


GCT 


w- 


-> 


117 


528 




































118 


Asp 


Ala 


Ala 


Glu 


Lys 


Ala 


Asp 


Ala 


Lys 


Glu 


Gly 


Lys 


Leu 


Asn 


Ala 


Ala 


w- 


-> 


119 










165 










170 










175 




w- 


-> 


121 


GGT 


GCT 


GAG 


GGT 


ACG 


ACT 


AAC 


GCG 


GAT 


GCT 


GGG 


AAG 


TTG 


TTT 


GTG 


AAG 


w- 


-> 


122 


576 




































123 


Gly 


Ala 


Glu 


Gly 


Thr 


Thr 


Asn 


Ala 


Asp 


Ala 


Gly 


Lys 


Leu 


Phe 


Val 


Lys 


w- 


-> 


124 








180 










185 










190 








GTT 



Val 
GCT 



Ala 

GCA 

Ala 

ATA 

He 

AAG 

Lys 
80 
AAG 

Lys 

GCT 

Ala 
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RAW SEQUENCE LISTING DATE: 05/12/2000 

PATENT APPLICATION: US/09/445,803 TIME: 11:04:23 

Input Set : A:\sequence. 1st. txt 

Output Set: N:\CRF3\05122000\I445803.raw 



w- 


-> 


126 


AAT 


GCT 


GGT AAT 


GTG 


GGT 


GGT 


GAA 


GCA 


GGT 


GAT 


GCT 


GGG 


AAG 


GCT 


GCT 


w- 


-> 


127 


624 


































128 


Asn 


Ala 


Gly Asn 


Val 


Gly 


Gly 


Glu 


Ala 


Gly 


Asp 


Ala 


Gly 


Lys 


Ala 


Ala 


w- 


-> 


129 






195 








200 










205 








w- 


-> 


131 


GOT 


GCG 


GTT GCT 


GCT 


GTT 


AGT 


GGG 


GAG 


CAG 


ATA 


TTA 


AAA 


GCG 


ATT 


GTT 


w- 


-> 


132 


672 


































133 


Ala 


Ala 


Val Ala 


Ala 


Val 


Ser 


Gly 


Glu 


Gin 


He 


Leu 


Lys 


Ala 


He 


Val 


w- 


-> 


134 




210 








215 










220 










w- 


-> 


136 


CAT 


GCT 


GCT AAG 


GAT 


GGT 


GGT 


GAG 


AAG 


CAG 


GGT 


AAG 


AAG 


GCT 


GCG 


GAT 


w- 


-> 


137 


720 


































138 


His 


Ala 


Ala Lys 


Asp 


Gly 


Gly 


Glu 


Lys 


Gin 


Gly 


Lys 


Lys 


Ala 


Ala 


Asp 


w- 


-> 


139 


225 








230 










235 










240 


w- 


-> 


141 


CGT 


ACA 


AAT CCC 


ATT 


GAC 


GCG 


GCT 


ATT 


GGG 


GGT 


GCG 


GGT 


GAT 


AAT 


GAT 


w- 


-> 


142 


768 


































143 


Arg 


Thr 


Asn Pro 


He 


Asp 


Ala 


Ala 


He 


Gly 


Gly 


Ala 


Gly 


Asp 


Asn 


Asp 


w- 


-> 


144 








245 










250 










255 




w- 


-> 


146 


GCT 


GCT 


GCG GCG 


TTT 


GCT 


ACT 


ATG 


AAG 


AAG 


GAT 


GAT 


CAG 


ATT 


GCT 


GCT 


w- 


-> 


147 


816 


































148 


Ala 


Ala 


Ala Ala 


Phe 


Ala 


Thr 


Met 


Lys 


Lys 


Asp 


Asp 


Gin 


He 


Ala 


Ala 


w- 


-> 


149 






260 










265 










270 






w- 


-> 


151 


GCT 


ATG 


GTT CTG 


AGG 


GGA 


ATG 


GCT 


AAG 


GAT 


GGG 


CAA 


TTT 


GCT 


TTG 


AAG 


w- 


-> 


152 


864 


































153 


Ala 


Met 


Val Leu 


Arg 


Gly 


Met 


Ala 


Lys 


Asp 


Gly 


Gin 


Phe 


Ala 


Leu 


Lys 


w- 


-> 


154 






275 








280 










285 








w- 


-> 


156 


GAT 


GCT 


GCT GCT 


GCT 


CAT 


GAA 


GGG 


ACT 


GTT 


AAG 


AAT 


GCT 


GTT 


GAT 


ATA 


w- 


-> 


157 


912 


































158 


Asp 


Ala 


Ala Ala 


Ala 


His 


Glu 


Gly 


Thr 


Val 


Lys 


Asn 


Ala 


Val 


Asp 


He 


w- 


-> 


159 




290 








295 










300 










w- 


-> 


161 


ATA 


AAG 


GCT GCT 


GCG 


GAA 


GCT 


GCA 


AGT 


GCT 


GCA 


AGT 


GCT 


GCT 


ACT 


GGT 


w- 


-> 


162 


960 


































163 


lie 


Lys 


Ala Ala 


Ala 


Glu 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Ala 


Ala 


Thr 


Gly 


w- 


-> 


164 


305 








310 










315 










320 


w- 


-> 


166 


AGT 


GCA 


GCA ATT 


GGG 


GAT 


GTT 


GTT 


AAT 


GGT 


AAT 


GGA 


GCA 


ACA 


GCA 


AAA 


w- 


-> 


167 


1008 
































168 


Ser 


Ala 


Ala He 


Gly 


Asp 


Val 


Val 


Asn Gly 


Asn 


Gly 


Ala 


Thr 


Ala 


Lys 


w- 


-> 


169 








325 










330 










335 




w- 


-> 


171 


GGT 


GGT 


GAT GCG 


AAG 


AGT 


GTT 


AAT 


GGC 


ATT 


GCT 


AAG 


GGA 








w- 


-> 


172 


1047 
































173 


Gly Gly 


Asp Ala 


Lys 


Ser 


Val 


Asn 


Gly 


He 


Ala 


Lys 


Gly 








E- 


-> 


174 






340 










345 




















254 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 3: 




















256 




(i) 


SEQUENCE CHARACTERISTICS: 




















257 






(A) LENGTH: 283 base pairs 


















258 






(B) TYPE: 


nucleic 


acid 




















259 






(C) STRANDEDNESS : 


double 




















260 






(D) TOPOLOGY: 


unknown 






















262 




(ii) 


MOLECULE TYPE: 


DNA 


(genomic) 


















267 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 3: 












E- 


-> 


269 


GCCGCTGGAT GGTGGTGAGA AGCAGGGTAA GAAGGCTGCG 


GATCGTAC2U^ ATCCCATTGA 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 4 5,803 



DATE: 05/12/2000 
TIME: 11:04:23 



Input Set : A:\sequence.lst.txt 

Output Set: N:\CRF3\05122000\I445803.raw 



W--> 270 60 

E--> 272 CGCGGCTATT GGGGGTGCGG GTGATAATGA TGCTGCTGCG GCGTTTGCTA CTATGAAGAA 
W--> 273 120 

E--> 275 GGATGATCAG ATTGCTGCTG CTATGGTTCT GAGGGGAATG GCTAAGGATG GGCAATTTGC 
W--> 276 180 

E--> 278 TTTGAAGGAT GCTGCTGCTG CTCATGAAGG GACTGTTAAG AATGCTGTTG ATATAATAAA 
W--> 279 240 

E--> 281 GGCTGCTGCG GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGT 
E--> 282 283 

284 (2) INFORMATION FOR SEQ ID NO : 4: 

286 (i) SEQUENCE CHARACTERISTICS: 

287 (A) LENGTH: 233 base pairs 

288 {B) TYPE: nucleic acid 

289 (C) STRANDEDNESS : double 

290 (D) TOPOLOGY: unknown 

292 (ii) MOLECULE TYPE: DNA (genomic) 

297 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

E--> 299 TTTATTATAT CAACAGATTC TTAACAGTCC CTTCATGAGC AGCAGCAGCA TCCTTCAAAG 
W--> 300 60 

E--> 302 CAAATTGCCC ATCCTTAGCC ATTCCCCTCA GAACCATAGC AGCAGCAATC TGATCATCCT 
W--> 303 120 

E--> 305 TCTTCATAGT AGCAAACGCC GCAGCAGCAT CATTATCACC CGCACCCCCA ATAGCCGCGT 
W--> 306 180 

E--> 308 CAATCGGATT TGTACGATCC GCAGCCTTCT TACCCTGCTT CTCACCACCA TCC 
E--> 309 233 

311 (2) INFORMATION FOR SEQ ID NO: 5: 

313 (i) SEQUENCE CHARACTERISTICS: 

314 (A) LENGTH: 194 base pairs 

315 (B) TYPE: nucleic acid 

316 (C) STRANDEDNESS: double 

317 (D) TOPOLOGY: unknown 

319 (ii) MOLECULE TYPE: DNA (genomic) 

324 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

E--> 326 CCGTGCAAGC TGGGTTGAAG AAGGTTGGGG ATGTTGTTAA GAATAGTGAG GCAAAAGATG 
W--> 327 60 J 

E--> 329 GTGATGCGGC GAGTGTTAAT GGGATTGCTA AGGGGATAAA GGGGATTGTT GATGCTGCTG 



E--> 332 AGAAGGCTGA TGCGAAGGAA GGGAAGTTGG TATGTGGCTG GTGCTGCTGG TGAAACTAAC 
W--> 333 180 

E--> 33 5 AAGGAAGCGG CCGC 
E--> 336 194 

338 (2) INFORMATION FOR SEQ ID NO: 6: 

340 (i) SEQUENCE CHARACTERISTICS: 

341 (A) LENGTH: 369 base pairs 

342 (B) TYPE: nucleic acid 

343 (C) STRANDEDNESS: double 

344 (D) TOPOLOGY: unknown 

346 (ii) MOLECULE TYPE: DNA (genomic) 

3 51 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6; 



W--> 330 120 
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RAW SEQUENCE LISTING DATE: 05/12/2000 

PATENT APPLICATION: US/09/445,803 TIME: 11:04:23 

Input Set : A:\sequence.lst.txt 

Output Set: N:\CRF3\05122000\I44 5803.raw 



E- 


-> 


353 


GCGGCCGCTT GAGGAAGCTG CAAGTGCTGC AAGTGCTGCT 


ACTGGTAATG 


CAGCGATTGG 


w- 


-> 


354 


60 






E- 


-> 


356 


GGATGTTGTT AAGAATAGTG GGGCAGCAGC AAAAGGTGGT 


GAGGCGGCGA 


GTGTTAATGG 


W- 


-> 


357 


120 






E- 


-> 


359 


GATTGCTAAG GGGATAAAGG GGATTGTTGA TGCTGCTGGA 


AAGGCTGATG 


CGAAGGAAGG 


W- 


-> 


360 


180 






E- 


-> 


362 


GAAGTTGGAT GCTACTGGTG CTGAGGGTAC GACTAACGTG 


AATGCTGGGA 


AGTTGTTTGT 


W- 


-> 


363 


240 






E- 


-> 


365 


GAAGAGGGCG GCTGATGATG GTGGTGATGC AGATGATGCT 


GGGAAGGCTG 


CTGCTGCGGT 


W- 


-> 


366 


300 






E- 


-> 


368 


TGCTGCAAGT GCTGCTACTG GTAATGCAGC GATTGGAGAT 


GTTGTTAATG 


GTGATGTGGC 


W- 


-> 


369 


360 






E- 


-> 


371 


AAAACAAAA 






E- 


-> 


372 


369 










374 


(2) INFORMATION FOR SEQ ID NO: 7: 










376 


(i) SEQUENCE CHARACTERISTICS: 










377 


(A) LENGTH: 142 base pairs 










378 


(B) TYPE: nucleic acid 










379 


(C) STRANDEDNESS : double 










380 


(D) TOPOLOGY: unknown 










382 


(ii) MOLECULE TYPE: DNA (genomic) 










387 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 






E- 


-> 


389 


AAGGATGGTG ATGATAAGCA GGGTAAGAAG GCTGAGGATG 


CTACAAATCC 


GATTGACGCG 


W- 


-> 


390 


60 






E- 


-> 


392 


GCTATTGGGG GTGCAGGTGC GGGTGCTAAT GCTGCTGCGG 


CGTTTAATAA 


TATGAAGAAG 


W- 


-> 


393 


120 






E- 


-> 


395 


GATGATCAGA TTGAGCGGCC GC 






E- 


-> 


396 


142 










398 


(2) INFORl-iATION FOR SEQ ID NO; 8: 










400 


(i) SEQUENCE CHARACTERISTICS: 










401 


(A) LENGTH: 210 base pairs 










402 


(B) TYPE: nucleic acid 










403 


(C) STRANDEDNESS: double 










404 


(D) TOPOLOGY: unknown 










406 


(ii) MOLECULE TYPE: DNA (genomic) 










411 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 






E- 


-> 


413 


GGTGAAACTA ACAAGGATGC TGGGAAGTTG TTTGTGAAGA 


AGAATGGTGA 


TGATGGTGGT 


W- 


- > 


414 


60 






E- 


-> 


416 


GATGCAGGTG ATGCTGGGAA GGCTGCTGCT GCGGTTGCTG 


CTGTTAGTGG 


GGAGCAGATA 


W- 


-> 


417 


120 






E- 


-> 


419 


TTAZyUVGCGA TTGTTGATGC TGCTA2^GAT GGTGATAAGA 


CGGGGGTTAC 


TGATGTAAAG 


W- 


- > 


4 20 


180 






E- 


-> 


422 


GATGCTACAA ATCCGATTGA CGCGGCTATT 






E- 


-> 


423 


210 










425 


(2) INFORMATION FOR SEQ ID NO: 9: 










427 


(i) SEQUENCE CHARACTERISTICS: 










428 


(A) LENGTH: 236 base pairs 










429 


(B) TYPE: nucleic acid 










4 30 


(C) STRANDEDNESS: double 







! 
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RAW SEQUENCE LISTING DATE: 05/12/2 000 

PATENT APPLICATION; US/09/44 5,803 TIME: 11:04:23 

Input Set : A:\sequence.lst:. txt 
Output Set: N:\CRF3\05122000\l445803.raw 

4 31 (D) TOPOLOGY: unknown 

4 33 (ii) MOLECULE TYPE: DNA (genomic) 

4 38 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: A aJL^ 

E--> 4 40 TATATAATAA AGGCTGCTGC GAAGCTGCAA GTGCTGCAAG TGCTGCTACT GGTAGTGCAG /lyih^'^^^^'^^ 
W--> 441 60 / J^^ 

E--> 443 CAATTGGGGA TGTTGTTAAT GGTAATGGAG CAACAGCAAA AGGTGGTGAT GCGAAGTGTT 
W--> 444 120 

E--> 4 46 AATGGGATTG CTAAGGGGAT AAAGGGGATT GTTGATGCTG CTGAGAAGGC TGATGCGAAG 
W--> 447 180 

E--> 449 GAAGGGAAGT TGGATGTGGC TGGTGATGCT GGTGAAACTA ACAAGGAAGC GGCCGC 
E--> 450 236 

4 52 (2) INFORMATION FOR SEQ ID NO: 10: 

4 54 (i) SEQUENCE CHARACTERISTICS: 

455 (A) LENGTH: 199 base pairs 

456 (B) TYPE: nucleic acid 
4 57 (C) STRANDEDNESS : double 
4 58 (D) TOPOLOGY: unknown 
4 60 (ii) MOLECULE TYPE: DNA (genomic) 
465 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

E--> 4 67 ATGAGAGGAT CTCATCACCA TCACCATCAC ACGGATCCCC CGGGCTGCAG GAATTCGCGG 
W--> 468 60 

E--> 470 CCGCTGAAGG CTGATGCGAA GGAAGGGAAG TTGGATGTGG CTGGTGCTGC TGGTGAAACT 
W--> 471 120 

E--> 4 73 AACAAGGATG CTGGGAAGTT GTTTGTGAAG AAGAATAATG AGGGTGGTGA AGCAAATGAT 
W--> 474 180 

E--> 4 76 GCTGGGAAGG CTGCTGCTG 
E--> 477 199 

479 (2) INFORMATION FOR SEQ ID NO: 11: 

4 81 (i) SEQUENCE CHARACTERISTICS: 

482 (A) LENGTH: 272 base pairs 

483 (B) TYPE: nucleic acid 

484 (C) STRANDEDNESS: double 
4 85 (D) TOPOLOGY: unknown 
487 (ii) MOLECULE TYPE: DNA (genomic) 
4 92 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

E--> 4 94 GCCGCTGGAT GATCAGATTG CTGCTGCTAT GGTTGTGAGG GGAATGGCTA AGGATGGGCA 
W--> 495 60 

E--> 4 97 GTTTGCTTTG AAGGATGATG CTGCTAAGGA TGGAGATAAA ACGGGGGTTG CTGCGGATGT 
W--> 498 120 

E--> 500 GAAAATCCGA TTGACGCGGC TATTGGGGGT GCGGATGCTG ATGCTGCGGC GTTTAATAAG 
W--> 501 180 

E--> 503 GAGGGGATGA AGAAGGATGA TCAGATTGCT GCTGCTATGG TTCTGAGGGG AATGGCTAAG 
W--> 504 240 

E--> 506 GATGGGCAGT TTGCTTTGAC GAATAATGCT GC 
E--> 507 272 

509 (2) INFORMATION FOR SEQ ID NO : 12: 

511 (i) SEQUENCE CHARACTERISTICS: 

512 (A) LENGTH: 289 base pairs 

513 (B) TYPE: nucleic acid 
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514 






515 






517 






522 


E- 


-> 


524 


W- 


- > 


525 


E- 


_ > 


527 


W- 




528 


E - 


- > 


^ irt 
D jU 


W- 


- > 


531 


E- 


- > 


533 


W- 


- > 


534 


E- 


-> 


536 


E- 


-> 


537 






539 






541 






542 






543 






544 






54 5 






547 






J DU 






5 51 






552 






554 






555 






556 






559 


E- 


-> 


561 


W- 


-> 


562 






563 


w- 


-> 


564 


w- 


-> 


566 


w- 


-> 


567 






568 


w- 


-> 


569 


w- 


-> 


571 


w- 


-> 


572 






573 


E- 


-> 


574 



RAW SEQUENCE LISTING DATE: 05/12/2000 

PATENT APPLICATION: US/09/445,803 TIME: 11:04:23 

Input Set : A:\sequence .1st .txt 

Output Set : N:\CRF3\05122000\I4 45803 .raw 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



60 



i 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: CDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 . .142 
(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 2 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 13: 
G CCG CTT ACA AAT CCG ATT GAC GCG GCT ATT GGG GGG AGT GCG GAT 
46 

Pro Leu Thr Asn Pro lie Asp Ala Ala lie Gly Gly Ser Ala Asp 
1 5 .10 15 



20 25 30 



35 40 45 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/445,803 



DATE: 05/12/2000 
TIME: 11:04:24 



L 


: 66 


M: 


334 


W: 


L 


: 67 


M: 


336 


W: 


L 


: 69 


M: 


336 


W: 


L 


: 71 


M: 


334 


W: 


L 


: 72 


M: 


336 


W: 


L 


: 74 


M: 


336 


W: 


L 


: 76 


M: 


334 


W: 


L 


:77 


M: 


336 


W: 


L 


: 79 


M: 


336 


W: 


L 


:81 


M: 


334 


W: 


L 


:82 


M: 


336 


W: 


L 


:84 


M: 


336 


W: 


L 


:86 


M: 


334 


W: 


L: 


:87 


M: 


336 


W: 


L; 


:89 


M: 


336 


W: 


L: 


:91 


M: 


334 


W: 


L; 


:92 


M: 


336 


W: 


L: 


:94 


M: 


336 


W: 


L: 


:96 


M: 


334 


W: 


L: 


:97 


M: 


336 


W: 


L: 


:99 


M: 


336 


W: 


L: 


:101 


M 


:334 


W 


L; 


:102 


M 


:336 


W 


L: 


:104 


M 


:336 


W 


L; 


:106 


M 


: 334 


W 


L; 


:107 


M 


:336 


W 


L: 


:109 


M 


:336 


W 


L: 


:111 


M 


:334 


W 


L: 


;112 


M 


:336 


W 


L: 


114 


M 


:336 


W 


L: 


116 


M 


:334 


W 


L: 


117 


M 


:336 


W 


L: 


119 


M 


:336 


W 


L: 


121 


M 


: 334 


W 


L: 


122 


M 


:336 


W 


L: 


124 


M 


:336 


W 


L: 


126 


M 


:334 


W 


L: 


127 


M 


:336 


W 


L: 


129 


M 


:336 


W 


L: 


131 


M 


: 334 


W 


L: 


132 


M 


:336 


W 


L: 


134 


M 


:336 


W 


L: 


136 


M 


: 334 


W 


L: 


137 


M 


:336 


W 


L: 


139 


M 


:336 


W 


L: 


141 


M 


:334 


W 


L: 


142 


M 


:336 


W; 


L: 


144 


M 


:336 


W; 



Input Set : A:\sequence .1st. txt 

Output Set: N:\CRF3\05122000\I445803.raw 

(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region 
(2) Invalid Amino Acid in Coding Region 
Invalid Amino Acid Number 
Invalid Amino Acid Number 



SEQ ID 
NUMBER OF 
in Coding Region, SEQ ID 
in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID 
Invalid Amino Acid Number in Coding Region, SEQ ID 



INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
1 

INVALID KEYS: 16 
1 
:1 

INVALID KEYS; 16 
:1 
:1 

INVALID KEYS: 16 
:1 
:1 

INVALID KEYS: 16 
:1 
:1 

INVALID KEYS: 16 
:1 
:1 

INVALID KEYS: 16 
:1 
:1 
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VERIFICATION SUMMARY 

PATENT APPLICATION; US/09/44 5,803 



DATE: 05/12/2000 
TIME: 11:04:24 



Input Set : 
Output Set: 



A: \sequence .1st . txt 
N:\CRF3\05122000\l445803.raw 



L 


146 


M 


334 


W: 


L 


147 


M 


336 


W: 


L 


149 


M 


336 


W: 


L 


151 


M 


334 


W: 


L 


152 


M 


336 


W: 


L 


154 


M 


336 


w- 


L 


156 


M 


334 


W 


L 


157 


M 


336 


W 


L 


159 


M 


336 


W 


L 


161 


M 


334 


W 


L 


162 


M 


336 


W 


L 


164 


M 


336 


W 


L 


166 


M 


334 


W 


L 


167 


M 


336 


W 


L 


169 


M 


336 


W 


L 


171 


M 


334 


w- 


L 


172 


M 


336 


w- 


L 


174 


M 


336 


W: 


L 


174 


M 


204 


E: 


L 


269 


M 


334 


w- 


L 


269 


M 


333 


E: 


L 


270 


M 


336 


W: 


L 


272 


M 


334 


W: 


L 


272 


M 


333 


E. 


L 


273 


M 


336 


W: 


L 


275 


M 


334 


W: 


L 


275 


M 


333 


E: 


L 


276 


M 


336 


W: 


L 


278 


M 


334 


W: 


L 


278 


M 


333 


E: 


L 


279 


M 


336 


W: 


L 


281 


M 


334 


W: 


L 


281 


M 


333 


E: 


L 


282 


M 


336 


W: 


L 


282 


M 


204 


E: 


L 


299 


M 


334 


W: 


L 


299 


M 


333 


E: 


L 


300 


M 


336 


W: 


L 


302 


M 


334 


W: 


L 


302 


M 


333 


E: 


L 


305 


M 


334 


W: 


L 


305 


M 


333 


E: 


L 


308 


M 


334 


W: 


L 


308 


M 


333 


E: 


L 


309 


M 


204 


E: 


L 


326 


M 


334 


W: 


L 


326 


M 


333 


E: 


L 


329 


M 


334 


W: 


L 


329 


M. 


333 


E: 



INVALID KEYS 
:1 
:1 



16 



(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 16 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 16 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 16 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID; 
(2) Invalid Amino Acid in Coding Region, NUMBER OF 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OP INVALID KEYS: 16 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 13 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
(24) Calc# of Bases differs from actual, LENGTH : Input : 1047 Counted: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
Wrong sequence grouping, Amino acids not in groups! 
Invalid Amino Acid Number in Coding Region, SEQ ID: 3 
(2) Invalid Amino Acid in Coding Region, 
Wrong sequence grouping, Amino acids not 
Invalid Amino Acid Number in Coding Region, SEQ ID: 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
Wrong sequence grouping, Amino acids not in groups' 
Invalid Amino Acid Number in Coding Region, SEQ ID; 3 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
Wrong sequence grouping, Amino acids not in groups! 
Invalid Amino Acid Number in Coding Region, SEQ ID; 3 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 5 
Wrong sequence grouping, Amino acids not in groups! 
Invalid Amino Acid Number in Coding Region, SEQ ID; 3 
(24) Calc# of Bases differs from actual, LENGTH : Input : 283 Counted:0 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
Wrong sequence grouping, Amino acids not in groups! 
Invalid Amino Acid Number in Coding Region, SEQ ID: 4 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
Wrong sequence grouping, Amino acids not in groups' 
(2) Invalid Amino Acid in Coding Region, 



NUMBER OF INVALID KEYS: 
in groups ! 

: 3 



NUMBER OF INVALID KEYS : 6 



Wrong sequence grouping, Amino acids not in groups! 
(2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
Wrong sequence grouping, Amino acids not in groups' 
(24) Calc# of Bases differs from actual, 
(2) Invalid Amino Acid in Coding Region, 
Wrong sequence grouping, Amino acids not 
(2) Invalid Amino Acid in Coding Region, 



LENGTH: Input: 23 3 Counted: 
NUMBER OF INVALID KEYS : 6 
in groups ! 

NUMBER OF INVALID KEYS : 6 



Wrong sequence grouping, Amino acids not in groups! 
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VERIFICATION SUMMAEY 

PATENT APPLICATION: US/09/44 5,803 



DATE: 05/12/2000 
TIME: 11:04:24 



Input Set : A:\sequence.lst.txt 

Output Set: N:\CRF3\05122000\l445803.raw 



L:332 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 6 
L:332 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:335 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 2 
L:3 35 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:336 M:204 E: (24) Calc# of Bases differs from actual, LENGTH : Input : 194 Counted :0 

L:353 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:3 53 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:356 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:356 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:359 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:359 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:362 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS: 6 

L:362 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:365 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:365 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:368 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:368 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:371 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 1 

L:371 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:372 M:204 E: (24) Calc# of Bases differs from actual, LENGTH : Input : 369 Counted : 0 

L:389 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:389 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:392 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:392 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:395 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 3 

L:395 M:333 E: Wrong sequence grouping, Amino acids not in groups I 

L:396 M:204 E: (24) Calc# of Bases differs from actual, LENGTH : Input : 14 2 Counted : 0 

L:413 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:413 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L;416 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:416 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:419 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 

L:419 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:422 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 3 

L:422 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:423 M:204 E: (24) Calc# of Bases differs from actual, LENGTH : Input : 210 Counted : 0 
L;440 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 6 
L:440 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:443 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:446 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:449 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:450 M:204 E: (24) Calc# of Bases differs from actual, LENGTH : Input : 23 6 Counted :0 
L:467 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:470 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:473 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
L:476 M:333 E: Wrong sequence grouping, Amino acids not in groups! 

L:477 M:204 E: (24) Calc# of Bases differs from actual, LENGTH : Input : 199 Counted : 0 
L:494 M:333 Ei Wrong sequence grouping, Amino acids not in groups! 
L:497 M:333 E: Wrong sequence grouping, Amino acids not in groups! 
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VERIFICATION SUMMARY 

PATENT APPLICATION; US/09/44 5,803 



DATE: 05/12/2000 
TIME: 11:04:24 



Input Set ; 
Output Set: 



A: \sequence . Ist . txt 
N:\CRF3\05122000\I445803.xaw 



L 


500 


M 


333 


E 


L 


503 


M 


333 


E 


L 


506 


M 


333 


E 


L 


507 


M 


204 


E 


L 


524 


M 


333 


E 


L 


527 


M 


333 


E 


L 


530 


M 


333 


E 


L 


533 


M 


333 


E 


L 


536 


M 


333 


E 


L 


537 


M 


204 


E 


L 


561 


M 


333 


E 


L 


574 


M 


204 


E 



Wrong sequence 
Wrong sequence 
Wrong sequence 
(24) Calc# of 
Wrong sequence 
Wrong sequence 
Wrong sequence 
Wrong sequence 
Wrong sequence 
(24) Calc# of 
Wrong sequence 
(24) Calc# of 



grouping 
grouping 
grouping 
Bases dif 
grouping 
grouping 
grouping 
grouping 
grouping 
Bases dif 
grouping 
Bases dif 



Amino acids not 
Amino acids not 
Amino acids not 

fers from actual, 
Amino acids not 
Amino acids not 
Amino acids not 
Amino acids not 
Amino acids not 

fers from actual, 
Amino acids not 

fers from actual, 



in groups : 
in groups ! 
in groups i 

LENGTH: Input : 272 Counted : 0 

in groups ! 

in groups : 

in groups ! 

in groups ! 

in groups ! 

LENGTH: Input: 289 Counted; 0 
in groups ! 

LENGTH: Input: 142 Counted :0 
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